Probabilistic Planning with Reduced Models

نویسنده

Luis Enrique Pineda

چکیده

Markov decision processes (MDP) (Puterman 1994) offer a rich model that has been extensively used by the AI community for planning and learning under uncertainty. Some applications include planning for mobile robots, network management, optimizing software on mobile phones, and managing water levels of river reservoirs. MDPs have polynomial complexity in the size of the state space, but the state space itself is exponential in the description size. Therefore, algorithms that try to find complete optimal plans are often impractical. Developing effective ways to tackle this complexity barrier is a challenging research problem. Determinization-based algorithms for solving MDPs have gained popularity in recent years (Yoon et al. 2008; Teichteil-Königsbuch et al. 2010; Keyder and Geffner 2008), motivated by the surprising success of the FF-Replan solver (Yoon et al. 2007). The main idea is to generate a deterministic version of the underlying MDP and solve it using a classical deterministic planner, resulting in a partial plan for the original problem. When confronted by an unexpected state during plan execution, the planning process is repeated using the current state as the initial state. The advantage of this approach is its ability to quickly generate partial plans, particularly in intractable probabilistic domains. Despite their success, determinization-based algorithms have drawbacks because they consider action outcomes in isolation. This leads to an overly optimistic view of the domain and can result in plans arbitrarily worse than optimal. Furthermore, even when optimal plans could be obtained using isolated outcomes, it is not always clear, nor intuitive, which outcomes should be included in the determinization. In my work I introduce and study a more general paradigm in which the single-outcome variant of FF-Replan is just one extreme point on a spectrum of MDP reductions that differ from each other along two dimensions: (1) the number of outcomes per state-action pair that are fully accounted for in the reduced model, and (2) the number of occurrences of the remaining outcomes that are planned for in advance. Similar treatments of exceptional outcomes have been explored in fault-tolerant planning (Jensen et al. 2004; Domshlak 2013; Pineda et al. 2013).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Probabilistic Integrated Planning of Primary and Secondary Distribution Networks based on a Hybrid Heuristic and GA Approach

The integrated planning of distribution system reveals a complex and non-linear problem being integrated with integer and discontinues variables. Due to these technical and modeling complexities, many researchers tend to optimize the primary and secondary distribution networks individually which depreciates the accuracy of the results. Accordingly, the integrated planning of these networks is p...

متن کامل

On the Undecidability of Probabilistic Planning and Infinite-Horizon Partially Observable Markov Decision Problems

We investigate the computability of problems in probabilistic planning and partially observable infinite-horizon Markov decision processes. The undecidability of the string-existence problem for probabilistic finite automata is adapted to show that the following problem of plan existence in probabilistic planning is undecidable: given a probabilistic planning problem, determine whether there ex...

متن کامل

On the Undecidability of Probabilistic Planning and Innnite-horizon Partially Observable Markov Decision Problems

We investigate the computability of problems in probabilistic planning and partially observable innnite-horizon Markov decision processes. The undecidability of the string-existence problem for probabilistic nite automata is adapted to show that the following problem of plan existence in probabilistic planning is undecidable: given a probabilistic planning problem, determine whether there exist...

متن کامل

Probabilistic Models in Planning An overview

Planning has been one of the main research areas in AI. For about three decades AI researchers explore alternative paths to build intelligent agents with advanced planning capabilities. However, the classical AI planning techniques suffer from inapplicability to real world domains, due to several assumptions adopted to facilitate research. Attempts to apply planning into real domains must addre...

متن کامل

On the Undecidability of Probabilistic Planning and In nite-Horizon Partially Observable Markov Decision Problems

متن کامل

A Probabilistic Approach to Transmission Expansion Planning in Deregulated Power Systems under Uncertainties

Restructuring of power system has faced this industry with numerous uncertainties. As a result, transmission expansion planning (TEP) like many other problems has become a very challenging problem in such systems. Due to these changes, various approaches have been proposed for TEP in the new environment. In this paper a new algorithm for TEP is presented. The method is based on probabilisti...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2014

Probabilistic Planning with Reduced Models

نویسنده

چکیده

منابع مشابه

Probabilistic Integrated Planning of Primary and Secondary Distribution Networks based on a Hybrid Heuristic and GA Approach

On the Undecidability of Probabilistic Planning and Infinite-Horizon Partially Observable Markov Decision Problems

On the Undecidability of Probabilistic Planning and Innnite-horizon Partially Observable Markov Decision Problems

Probabilistic Models in Planning An overview

On the Undecidability of Probabilistic Planning and In nite-Horizon Partially Observable Markov Decision Problems

A Probabilistic Approach to Transmission Expansion Planning in Deregulated Power Systems under Uncertainties

عنوان ژورنال:

اشتراک گذاری